Our first graph
New image here
Step 1
Tidy data by Whickham
Step 2:
Practice Exercise 1: Can you do this for the pine data we have collected?
Step 3
Step 4
Step 5
Cleaning Data -
Step 6
A good way to organize script files is number them in the order they get run.
The important considerations in data
USE CONTROLLED VOCABULARY AND BE CONSISTENT THINK BEFORE DOING –> WHAT HAPPENS DOWN THE ROAD
Broman KW, & Woo KH. 2018. Data organization in spreadsheets. The American Statistician 72: 2-10 (HERE)
example of fish data
Practice Exercise 3: How do we fix variable names?
So there are two issues
# lets do # 2 first - no pun intended
# if you wanted to rename variables what would would you do?
# now time for #1 - there are tools to make your life easier
# install.packages("janitor") # what does a janitor or BSW do?
# library(janitor)
# lets read in a messy file... junk.csv
# first look at the file
# df <- read_csv("data/junk.csv)
# df_excel <- read_excel("data/junk.csv")_is 01 04 2025 the same as 04 01 2025
what are the dates in english?
or European
Practice Exercise 3: Are there ways to deal with excel and why is it a problem
What is data about
Who collected it
When
Where
Funding agency
Methods used to collect
Variable names
description
units
abbreviations
CALCULATIONS AND WHY?
We need to know what happened and why and the units and WTF it means?
TGW - yep its a thing
ODO - what do you think it is?
NO3 - what is it? Are you sure? Why might you get in legal trouble if you used this?